
Sequenced molecule: genomic y#NA 
Name: GBSS gene fragment £rom potato 
Length of sequence: 342 >bp 



7GCATGTTTC CCTACA77C7 A7T7AGAATC GTGTTGTGGT GTATAAACGT 
TGTTTCATAT CTCATCTCAT C^A77C7GA7 777GA7TC7C TTGCCTAC7G 
TAATCGGTGA TAAATGTGAA 7GC77CC777 CTTCTCAGAA A7CAA777C7 
G7777G777T 7G77CA7C7G/7AGC77A77C 7C7GG7AGA7 7CCCC77777 
GTAGACCACA CA7CAC A7$/ GCA AGC A7C ACA GC7 7CA CAC CAC 

Ala Ser lie Thr Ala Ser His His 
5 



50 
100 
150 
200 
243 



77T G7G 7CA AGA XGC CAA AC 7 7CA CTA GAC ACC AAA 7CA ACC 
?he Val Ser Arg/ Ser Gin Thr Ser Leu Aso Thr Lys Ser 7hr 
10 / 15 20 



285 



77G 7CA CAG A 7 A GGA C7C AGG AAC CA7 AC 7 C7G AC 7 CAC AA7 
Leu Ser Gin/ He Gly Leu Arg Asn His Thr Leu Thr His Asn 
25 / 30 35 



327 



GG7 T7A AGG GC7 G7T 
Gly Leu/ Arg Ala Val 
40 



342 





V 




Sequenced molecule: genomic pNA 
Name: GBSS gene fragment frpm potato 
Length of sequence: 2549 

AAC AAG CTT GAT GGG CTC CAA TCA f(CA ACT AAT ACT AAG GTA 
Asn Lys Leu Aso Gly Leu Gin Ser /Thr Thr Asn Thr Lys Val 
45 * 50/ 55 

ACA CCC AAG ATG GCA TCC AGA A(£t GAG ACC AAG AGA CCT GGA 
Thr Pro Lys Met Ala Ser Arg Thr Glu Thr Lys Arg Fro Gly 

60 / 65 70 

TGC TCA GCT ACC ATT GTT TG^ GGA AAG GGA ATG AAC TTG ATC 
Cys Ser Ala Thr lie Val Cys Gly Lys Gly Met Asn Leu lie 

75 7 80 

TTT GTG GGT ACT GAG GTT VGGT CCT TGG AGC AAA ACT GGT GGA 
tsv e va 1 '~- 1 y Thr Glu Val/Gly Pro Tro Ser Lys Thr Gly Gly 
85 9(7 ~ 95 

CTA GGT GAT GTT CTT G^T GGA CTA CCA CCA GCC CTT GCA 
Leu Gly Aso Val Leu Gly Gly Leu Pro Pro Ala Leu Ala 
100 ' / 105 110 

GTAAGTC7TT CTTTCATTjfrG GTTACCTACT CATTCATTAC TTATTTTGTT 
TAGTTAG7 7T CTACTGC^TC AGTCTTTTTA TCATTTAG GCC CGC GGA 

Ala Arg Gly 



42 



84 



126 



168 



207 



257 
304 



CAT CGG GTA ATG ACA ATA TCC CCC CGT TAT GAC CAA TAC AAA 
His Arc Val Met/ Thr lie Ser Pro Arg Tyr Aso Gin Tyr Lys 
115 ~ I 120 125 



346 



GAT GCT TGG GAT ACT GGC GTT GCG GTT GAG 
Aso Ala Tro Aso Thr Gly Val Ala Val Glu 
130 ' / * 135 



GTACATCTTC 



386 



CTATATTGA? A C G G TAC AAT ATTGTTCTCT TACATTTCCT GAT TC AAG AA 
TGTGA7CA7/C TGCAG GTC AAA GTT GGA GAC AGC ATT GAA ATT GTT 

Val Lys Val Gly Aso Ser lie Glu He Val 
14 0 14 5 



436 
481 



CGT TTG TTT CAC TGC TAT AAA CGT GGG GTT GAT CGT GTT TTT 
Arc Ph4 ?he His Cys Tyr Lys Arg Gly Val Asp Arg Val Phe 



523 



23 



TATGATTATG AATCCGTCCT GAGGGATACG CAGAACA^GT CATTTTGAGT 
ATCTTTTAAC TCTACTGGTG CTTTTACTCT TTTAAG /GTT TGG GGC AAA 

'Val Tro Gly Lys 
175 



610 
658 



ACT GGT TCA AAA ATC TAT GGC CCC AAA GCT GGA CTA GAT TAT 
Thr Gly Ser Lys lie Tyr Gly Pro Lys Ala Gly Leu Asp Tyr 

180 485 



700 



CTG GAC AAT GAA CTT AGG TTC AGC TTG TTG TGT CAA 
Leu Aso Asn Glu Leu Arg Phe Ser Leu Leu Cys Gin 
190 " 155 / 200 



736 



GTAAGTTAGT TACTCTTGAT TTTTATGTGG CATTTTACTC TTTTGTCTTT 
AATCGTTTTT TTAACCTTGT TTTCTCAG/GCA GCC CTA GAG GCA CCT 

Ala Ala Leu Glu Ala Pro 
205 



786 
832 



AAA GTT TTG AAT TTG AAC AGT AGC AAC TAC TTC TCA GGA CCA 
Lys Val Leu Asn Leu Asn Ser/ Ser Asn Tyr ?he Ser Gly Pro 
210 / 215 220 



874 



TAT G 
Tyr G 



G T AA T T AAC A CATGCTAGTT TCAGAAAACT CCTTACTATA 



918 



TCATTGTAGG TAATCATCTT TjATTTTGCCT ATTCCTGCAG GA GAG GAT 

ly Glu Aso 
225 



966 



GTT CTC TTC ATT GCC A/AT GAT TGG CAC ACA GCT CTC ATT CCT 
Val Leu Phe lie Ala Asn Aso Tro His Thr Ala Leu lie Pro 

230 / * ~ 235 



1008 



TGC TAC TTG AAG TCA ATG TAC CA<G TCC AGA GGA ATC TAC TTG 
Cys Tvr Leu Lys Ser Met Tyr Gin Ser Arg Gly lie Tyr Leu 
240 * / 245 250 



1050 



AAT GCC AAG 
Asn Ala Lys 
255 



GTAAAATTTC TTTGTATTCA CTCGATTGCA 



1089 



CGTTACCCTG CAAATCAGTA AGGTTGTATT AATATATGAT AAA T T T CAC A 
TTGCCTCCAG GTT GCT TTC TGC ATC CAT AAC ATT GCC TAC CAA 
Va/i Ala Phe Cys He His Asn He Ala Tyr Gin 
260 265 



1139 
1182 



GGT CGA TTT/ TCT TTC TCT GAC TTC CCT CTT CTC AAT CTT CCT 
Gly Ar a Ph£ Ser Phe Ser Aso Phe Pro Leu Leu Asn Leu Pro 
27/0 275 280 



1224 



:ct 



'T GAT 



1 £OS> 



24 



AGTAAATTGA GTTTTTAAAA TTTTGCAGAT ATGAG/AAG CCT GTT AAG 

Lys Pro Val Lys 
295 



1360 



GGT AGG AAA ATC AAC TGG ATG AAG GCT /GGG ATA TTA GAA TCA 
Gly Arg Lys lie Asn Tro Met Lys Ala/ Gly lie Leu Glu Ser 
300 * 305 / 310 



1402 



CAT AGG GTG GTT ACA GTG AGC CCA fhC TAT GCC CAA GAA CTT 
His Arg Val Val Thr Val Ser Pro /Tyr Tyr Ala Gin Glu Leu 
315 320/ 325 



1444 



GTC TCT GCT GTT GAC AAG GGA GT'T GAA TTG GAC AGT GTC CTT 
Val Ser Ala Val Asd Lys Gly Val Glu Leu Aso Ser Val Leu 
330 * / 335 340 



1486 



CGT AAG ACT TGC ATA ACT QGGf ATT GTG AAT GGC ATG GAT ACA 
Arg Lys Thr Cys lie Thr Gly lie Val Asn Gly Met Asp Thr 

345 / 350 



1528 



CAA GAG TGG AAC CCA GCG ACT GAC AAA TAC ACA GAT GTC AAA 
Gin Glu Tro A.sn Pro Ala /Thr Asp Lys Tyr Thr Aso Val Lys 
355 * 360/ 365 



1570 



TAC GAT ATA ACC ACT 
Tyr Aso He Thr Thr 
370 



£ T AAG AT AAG ATTTTTCCGA CTCCAGTATA 



1615 



TACT AAA: I A TTTTGTATG/T TTATGAAATT AAAGAGTTCT TGC T AAT CAA 
AATCTCTATA CAG GTC ATG GAC GCA AAA CCT TTA CTA AAG GAG 
Val pet Asp Ala Lys Pro Leu Leu Lys Glu 
7375 380 



1665 
1708 



GCT CTT CAA GCA 
Ala Leu Glr. Ala 
355 



;A GTT GGC TTG CCT GTT GAC AAG AAG ATC 
.a Val Gly Leu Pro Val Aso Lys Lvs He 
390 * 3S5 



1756 



CCT TTG ATT GGO TTC ATC GGC AGA CTT GAG GAG CAG AAA GGT 
Pro Leu He Gly Phe He Gly Arg Leu Glu Glu Gin Lys Gly 
400 / 405 410 



1792 



TCA GAT ATT OTT GTT GCT GCA ATT CAC AAG TTC ATC GGA TTG 
Ser Aso He Leu Ala Val Ala He His Lys Phe He Gly Leu 
15 420 425 



1834 



GAT GTT CAJV ATT GTA GTC CTT 
Aso Val Zlfi lie Val Val Leu 

430 



GTAAGTACCA AATGGACTCA 



1875 



TGGTA7C7JT CTTGTTGAGT TTACTTGTGC CGAAAC7GAA ATTGACCTGC 
7AC7CA7 7a 7 ATGCATCAG GGA ACT GGC AAA AAG GAG TTT GAG 

Gly 7hr Gly Lys Lys Glu Phe " 



1925 
1965 




CAG GAG ATT GAA CAG CTC GAA GTG TTG/TAC CCT AAC AAA GCT 
Gin Glu lie Glu Gin Leu Glu Val Le/i Tyr Pro Asn Lys Ala 

445 / 450 



2010 



AAA GGA GTG GCA AAA TTC AAT GTC 
Lys Gly Val Ala Lys ?he Asn V&J. 
455 460 



CCT TTG GCT CAC ATG ATC 
Pro Leu Ala His Met He 
4 65 



2052 



ACT GCT GGT GCT GAT TTT ATG/ TTG GTT CCA AGC AGA TTT GAA 
Thr Ala Gly Ala Asp Phe Me£ Leu Val Pro Ser Arg Phe Glu 
470 * 4^5 480 



2094 



CCT TGT GGT CTC ATT CAG 7 TTA CAT GCT ATG CGA TAT GGA ACA 
Pro Cys Gly Leu He Glfo Leu His Ala Met Arg Tyr Gly Thr 
4 55 / 4 90 4 95 



2136 



GTAAGAACCA GAAGAGCT^G TACCTTTTTA CTGAGTTTTT AAAAAAAGAA 
TCATAAGACC TTGTTTTlZCA TCTAAAGTTT AATAACCAAC TAAATGTTAC 
TGCAGCAAGC TTTTCA/TTC TGAAAATTGG TTATCTGATT TTAACGTAAT 
CACATGTGAG TCAG 0TA CCA ATC TGT GCA TCG ACT GGT GGA CTT 

il Pro He Cys Ala Ser Thr Gly Gly Leu 
500 505 



2186 
2236 
2286 
2330 



GTT GAC ACT GJ?G AAA GAA GGC TAT ACT GGA TTC CAT ATG GGA 
Val As© Thr yal Lys Glu Gly Tyr Thr Gly Phe His Met Gly 
.0 515 520 



2372 



GCC TTC AAT GTT GAA GTATGTGATT T T AC ATC AAT TGTGTACTTG 
Ala Phe .-.?T; Val Glu 

525 



2417 



TACATGGrC C ATTCTCGTCT TGATATACCC CTTGTTGCAT AAACATTAAC 
TTATTGG^TC TTGAATTTGG TTAG TGC GAT GTT GTT GAC CCA GCT 

Cys As© Val Val As© Pro Ala 

530 



2467 
2512 



GAT /GTG CTT AAG ATA GTA ACA ACA GTT GCT AGA GCT C 
Asy Val leu Lys He Val Thr Thr Val Ala Arg Ala 
53= 540 



2549 




V 



SEQ ID/No. 3 



Sequenced molecule: genomic DNA 
Name: GBSS gene f ragmen t/xrom potato 
Length of sequence: 492/bp 

GAG CTC TCC TGG AAG GTAAGTG/TGA ATTTGATAAT TTGCGTAGGT 
Glu Leu Ser Trp Lys 
565 



45 



ACTTCAGTTT GTTGTTCTCG TCASCACTGA TGGATTCCAA CTGGTGTTCT 
TGCAG GAA CCT (SCC AAG AAA TGG GAG ACA TTG 

Glu Pro Ala Lys Lys Trp Glu Thr Leu 
570/ 575 



95 
127 



CTA TTG GGC TTA GGA 
Leu Leu Glv Leu Gly 
5B0 



:T TCT GGC AGT GAA CCC GGT GTT GAA 
Qa Ser Gly Ser Glu Pro Gly Val Glu 
585 590 



169 



GGG 
Gly 



GAA 
Glu 



GAA 
Glu 



ATC 
lie 
595 



IT 
Xla 



la P 



CCA 
Pro 



CTT 
Leu 



GCC AAG 
Ala Lys 
600 



GAA AAT 
Glu Asn 



GTA GCC 
Val Ala 



ACT 
Thr 
605 



211 



CCT 
Pro 
606 



TAA 



ATGAGCTTTG GTTATCCTTG TTTCAACAAT AAGATCATTA 



257 



AGCAAACG7A, 
ATCATCTAC2 
TGTAAAA' 
TGGATCAA&G 
CTTAGTAXTG 



TTACTAGCG 
AAATGATTGG 
TGGTTAATGT 
T C AA TAG AAA 
TAGCATAAAT 



AACTATGTAG 
TTTTTGCTGG 
TTTTGTAGGT 
ATAGTTATTA 
AATAC TAGAA 



AACCCTATTA 
GGAGCAGCAG 
AAGGGCTATT 
CTAACGTTTG 
CTAGT 



TGGGGTCTCA 
CATATAAGGC 
TAAGGTGGTG 
CAACTAAATA 



307 
357 
407 
457 
492 




SEQ 




Sequenced molecule: genomid DNA 
Name: Promoter for the GBSS gene from potato 
Length of sequence: 98/ bp 



AAGCTTTAAC 
ACAAATGCAA 
TCTATTTGGC 
TTGACTCCTC 
CATTAAGTCC 
ATTTTTTTTT 
AAGCTCGTTA 
GTTGTTTTTA 
CCATGCATCT 
AAGA.TAAA G T 
CTTGTTTGTC 

aaagctaagg 
cagtacctag 
gagaca7agg 
t t ag a.ta.c t a 
g7ccagccg7 
tttcccta.ta 
ccgctattc1 
cacctcctcA 
tttactca 



GAGATAGAAA 
CAGTATCTTG 
TGTTGACGGA 
CGCCAGATAT 
TCATCTATGG 
CCTGCCAAGT 
AAAAAA. 

accatcct 
caatcttxat 
aatgta/tca 

G T AAA-3CAA T T 
GGGAMG TAAC 
ATA7TATTTT 
AA7GTCAAGT 
-GACAGAA 
-ATCAACAA 
TACAGTG TC 
CTTGACACGT 
TTCTCACTCA 
TCCTCCAATT 



atta/gttac 
tacgaaatcc 
gtaatcagga 
tjctgatttat 
t atactic a 
'ggaacggaga 
caataggaag 
ctttagca.gt 

ACTAAAATGC 
ACCTTTAGAA 
A G AAAA T A T A 
TAATATTCTA 
T AA T T A C T A. T 
GGTAGCGTAG 
CCGGACGGCC 
AGAGAGGGCC 
CACAGTTGCC 
GTCACTGAAA 
CTCACTCACA 
ATTTCTGATT 



TCCGTTTTGT 
TTTCTCTCTT 
TACAAACCAC 
GAATCCTCGA 
CAGTATCTTC 
CATGTTATGA 
AAA T G T AA C A 
GTATCAATTT 
AAC T TAATAT 
TTGTGCATTC 
T T TACAG T AA 
GTGGAGGGAG 
AA T AATAAT T 
GAGGGA.GTTG 
CATTGCAAGG 
CATAATACTG 
TTCTGCTAAG 
CCTGCTACAA 
CAGCTCAACA 
TCATGCA 



TCATTACTTA 
TTCAAACTTT 
AAGTATTTAA 
AAAGCCTATC 
CTGTTTGGGT 
TGTATACGGG 
AACATTGAAT 
TGTAATAGAA 
AGGCTAAACC 
ATAATTAGAT 
TTTGGAATAC 
GGACCAGTAC 
TAATTAACAC 
GTTTAGTTTT 
CC AAGTTGAA 
TCGATGAGCA 
GGATAGCCAC 
ATAAGGCAGG 
AGTGGTAACT 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
850 
900 
550 
987 



SEQ Ig No* 5 



Sequenced molecule: genomic DNA 
Name: GBSS gene from potato 
Length of sequence: 496/4 bp 



AAGCTTTAAC GAGATAGAAA ATTATGTT^C TCCGTTTTGT TCATTACTTA 50 

ACAAATGCAA CAGTATCTTG T AC C AAA J C C TTTCTCTCTT TTCAAACTTT 100 

TCTATTTGGC TGTTGACGGA GTAATCAGGA TACAAACCAC AAGTATTTAA 150 

TTGACTCCTC CGCCAGATAT TATGATTTAT GAATCCTCGA AAAGCCTATC 2 00 

CATTAAGTCC TCATCTATGG ATATACTTGA CAGTATCTTC CTGTTTGGGT 250 

ATTTTTTTTT CCTGCCAAGT GGAAOfGGAGA CATGTTATGA TGTATACGGG 300 

AAGCTCGTTA AJLAAAAAATA CAATAGGAAG AAATGTAACA AACATTGAAT 350 

GTTGTTTTTA A.CCATCCTTC CTTTAGCAGT GTATCAATTT TGTAATAGAA 400 

CCATGCATCT CAATCTTAAT ACTAAAATGC AACTTAATAT AGGCTAAACC 4 50 

AAGATAAAGT AATGTATTCA AOCTTTAGAA TTGTGCATTC ATAATTAGAT 500 

CTTGTTTGTC GTAAAAAATT AjGAAAATATA TTTACAGTAA TTTGGAATAC 550 
AAAGCTAAGG GGGAAGTAAC 7 AA T A T T C T A GTGGAGGGAG GGACCAGTAC 600 
CAGTACC7AG ATATTATTTT yTAATTACTAT AATAATAATT TAATTAACAC 650 

GAGACATAGG AATGTCAAGT / GGTAGCGTAG GAGGGAGTTG GTTTAGTTTT 700 

TTAGATACTA GGAGACAGA?/ CCGGACGGCC CATTGCAAGG CCAAGTTGAA 750 
GTCCAGCCGT GAATCAACAA AGAGAGGGCC CATAATACTG TCGATGAGCA 8 00 

TTTCCCTATA ATACAGTGTC CACAGTTGCC TTCTGCTAAG GGATAGCCAC 850 
CCGCTATTCT CTTGACACST G T C AC T G AAA CCTGCTACAA ATAAGGCAGG 900 
CACCTCCTCA TTCTCACTCA CTCACTCACA CAGCTCAACA AGTGGTAACT 950 

TTTACTCATC TCCTCCA&TT ATTTCTGATT TCATGCATGT TTCCCTACAT 1000 

TCTATTATGA ATCGTG7TGT GG T G T AT AAA CGTTGTTTCA TATCTCATCT 1050 

CATCTATTCT GAT T TTG ATT CTCTTGCCTA CTGTAATCGG TGATAAATGT 1100 

GAATGCTTCC TTTCTTCTCA GAAATCAATT TCTGTTTTGT TTTTGTTCAT 1150 

CTGTAGCTTA TTCTyTGGTA GATTCCCCTT TTTGTAGACC ACACATCAC 1199 

ATG GCA AGC ATC/ ACA GCT TCA CAC CAC TTT GTG TCA AGA AGC 1241 
Met Ala Ser lis Thr Ala Ser His His ?he Val Ser Arg Ser 

1 / 5 10 

CAA ACT TCA OTA GAC ACC AAA TCA ACC TTG TCA CAG ATA GGA 1283 
Gin Thr Ser leu A.so Thr Lys Ser Thr Leu Ser Gin lie Gly 
15 / * 20 25 

CTC AGG AACT CAT ACT CTG ACT CAC AAT GGT TTA AGG GCT GTT 1325 
Leu Arg Asp. His Thr Leu Thr His Asn Gly Leu Arg Ala Val 
30 / 35 40 

AAC AAG GTT GAT GGG CTC CAA TCA ACA ACT AAT ACT AAG GTA 1367 
Asn Lys Leu Aso Gly Leu Gin Ser Thr Thr Asn Thr Lys Val 
/ 45 50 55 

ACA CCC AAG ATG GCA TCC AGA ACT GAG ACC AAG AGA CCT GGA 14 0 9 
Thr Prro Lys Met Ala Ser Arg Thr Glu Thr Lys Arg Pro Gly 
/ 60 65 70 

TGC TCA GCT ACC ATT GTT TGT GGA AAG GGA ATG AAC TTG ATC 14 51 
Cys/Ser Ala Thr lie Val Cys Gly Lys Gly Met Asn Leu lie 
/ 75 80 

TTT GTG GGT ACT GAG GTT GGT CCT TGG AGC AAA ACT GGT GGA 14 93 
^he Vai Gly Thr Glu Val Gly Pro Trp Ser Lys Thr Gly Gly 



CTA GGT GAT GTT CTT GGT GGA CTA CCA/ CCA GCC CTT GCA 
Leu Gly Asp Val Leu Gly Gly Leu Prp Pro Ala Leu Ala 
100 * 105 / 110 



1532 



GTAAGTCTTT CTTTCATTTG GTTACCTACT/ CATTCATTAC TTATTTTGTT 
TAGTTAGTTT CTACTGCATC AGTCTTTTT^ TCATTTAG GCC CGC GGA 

Ala Arg Gly 



1582 
1629 



CAT CGG GTA ATG ACA ATA TCC CCC CGT TAT GAC CAA TAC AAA 
His Arg Val Met Thr lie Ser/Pro Arg Tyr Asp Gin Tyr Lys 
115 120 / 125 



1671 



GAT GCT TGG GAT ACT GGC GTT GCG GTT GAG 
Asp Ala Tro Asd Thr Gly Val Ala Val Glu 
130 * /135 



GTACATCTTC 



1711 



CTATATTGAT ACGGTACAAT &TTGTTCTCT TACATTTCCT GATTCAAGAA 
TGTGATCATC TGCAG GTC AAA GTT GGA GAC AGC ATT GAA ATT GTT 

Val £ys Val Gly Asp Ser lie Glu He Val 
.40 145 



1761 
1806 



CGT TTC ITT CAC TGC/TAT AAA CGT GGG GTT GAT CGT GTT TTT 
Arg ?he ?r.e His Cys/ Tyr Lys Arg Gly Val Asp Arg Val Phe 
150 / 155 160 



1848 



GTT GAC CAC CCA ATG TTC TTG GAG AAA 
Val Asp His Pro Met Phe Leu Glu Lys 
165 / 170 



GTAAGCATAT 



1885 



TAT GAT TAT G AATCCGTCCT GAGGGATACG CAGAACAGGT CATTTTGAGT 
ATCTTTTAAC TCTACTGGTG CTTTTACTCT TTTAAG GTT TGG GGC AAA 

Val Trp Gly Lys 
175 



1935 
1983 



ACT GGT TCA AAA ATC TAT GGC CCC AAA GCT GGA CTA GAT TAT 
Thr Gly Ser yLys He Tyr Gly Pro Lys Ala Gly Leu Asp Tyr 

180 185 



2025 



CTG GAC AAT GAA CTT AGG TTC AGC TTG TTG TGT CAA 
Leu Asp Asn Glu Leu Arg Phe Ser Leu Leu Cys Gin 
190 / 195 200 



2061 



GTAAGTTAGT TACTCTTGAT TTTTATGTGG CATTTTACTC TTTTGTCTTT 
AATCGTTTTT TTAACCTTGT TTTCTCAG GCA GCC CTA GAG GCA CCT 

Ala Ala Leu Glu Ala Pro 
205 



2111 
2157 



AAA GjfT TTG AAT TTG AAC AGT AGC AAC TAC TTC TCA GGA CCA 
Lys yal Leu Asn Leu Asn Ser Ser Asn Tyr Phe Ser Gly Pro 
210 215 220 



2199 



30 



TAT G 
Tyr G 



GTAATTAACA CATCCTAGTT TCAG; 



^CT CCTTACTATA 



TCATTGTAGG TAATCATCTT TATTTTGCCT AjTTCCTGCAG GA GAG GAT 

ly Glu Asp 
225 



GTT CTC TTC ATT GCC AAT GAT 
Val Leu ?he lie Ala Asn Asp Tj 

230 



CAC ACA GCT CTC ATT CCT 
His Thr Ala Leu lie Pro 
235 



TGC TAC TTG AAG TCA ATG TAC /CAG TCC AGA GGA ATC TAC TTG 
Cys Tyr Leu Lys Ser Met Tyr/ Gin Ser Arg Gly lie Tyr Leu 
240 245 / 250 



AAT GCC AAG 
Asn Ala Lys 
255 



G T JJLAAAT TTC TTTGTATTCA CTCGATTGCA 



CGTTACCCTG CAAATCAGTA AGGTTGTATT AATATATGAT AAATTTCACA 
TTGCCTCCAG GTT GCT TTC TGC ATC CAT AAC ATT GCC TAC CAA 
Val Ala ?r/e Cys lie His Asn lie Ala Tyr Gin 
260 265 

GGT CGA TTT TCT TTQ TCT GAC TTC CCT CTT CTC AAT CTT CCT 
Gly Arg ?he Ser Phe Ser Aso ?he Pro Leu Leu Asn Leu Pro 
270 / " 275 280 

GAT GAA TTC AGG (£GT TCT TTT GAT TTC ATT GAT GGG TAT 
Aso Glu Phe Arg /Gly Ser Phe Aso Phe lie Asp Gly Tyr 
285/ * 230 

GTATTTAT3C TTGAAATCAG ACCTCCAACT TTTGAAGCTC TTTTGATGCT 
AGTAAATTGA GTJTTTAAAA TTTTGCAGAT AT GAG AAG CCT GTT AAG 

Lys Pro Val Lys 
295 

GGT AGG AAA/ ATC AAC TGG ATG AAG GCT GGG ATA TTA GAA TCA 
Gly Arg Lysf lie Asn Tro Met Lys Ala Gly lie Leu Glu Ser 
300 / 305 310 

CAT AGG G/iG GTT ACA GTG AGC CCA TAC TAT GCC CAA GAA CTT 
His Arg yal Val Thr Val Ser Pro Tyr Tyr Ala Gin Glu Leu 
.5 320 325 

GTC TCy GCT GTT GAC AAG GGA GTT GAA TTG GAC AGT GTC CTT 
Val Set Ala Val Asp Lys Gly Val Glu Leu Asp Ser Val Leu 



CAA GAG TGG AAC CCA GCG ACT GAC AAA T/AC ACA GAT GTC AAA 
Gin Glu Tro Asn Pro Ala Thr Asp Lys Tyr Thr Asp Val Lys 
355 * 360 / 365 



2895 



TAC GAT ATA ACC ACT 
Tyr Asp lie Thr Thr 
370 



GTAAGATAAG ATTTTTCCGA CTCCAGTATA 



2940 



TACTAAATTA TTTTGTATGT TTATGAAAT/f AAAGAGTTCT TGC TAATCAA 
AATCTCTATA CAG GTC ATG GAC GCA/AAA CCT TTA CTA AAG GAG 
Val Met Asp Alar Lys Pro Leu Leu Lys Glu 
375 / 380 



2990 
3033 



GCT CTT CAA GCA GCA GTT GGC/ TTG CCT GTT GAC AAG AAG ATC 
Ala Leu Gin Ala Ala Val Gly Leu Pro Val Asp Lys Lys lie 
385 39/0 395 



3075 



CCT TTG ATT GGC TTC ATC GGC AGA CTT GAG GAG CAG AAA GGT 
Pro Leu lie Gly Phe lie Gly Arg Leu Glu Glu Gin Lys Gly 
400 / 405 410 



3117 



TCA GAT ATT CTT GTT GCT GCA ATT CAC AAG TTC ATC GGA TTG 
Ser Aso lie Leu Ala Va/l A.la He His Lys Phe He Gly Leu 
415 / 420 425 



3159 



GAT GTT CAA ATT GTA GTC CTT 
Asp Val Gin He Val /Val Leu 

430/ 



GTAAGTACCA AATGGACTCA 



3200 



TGGTATCTCT CTTGTTGAGT TTACTTGTGC CGAAACTGAA ATTGACCTGC 
TACTCATCCT ATGCATjCAG GGA ACT GGC AAA AAG GAG TTT GAG 

Gly Thr Gly Lys Lys Glu Phe Glu 
435 440 



3250 
3293 



CAG GAG ATT GAA/ CAG CTC GAA GTG TTG TAC CCT AAC AAA GCT 
Gin Glu He Gly Gin Leu Glu Val Leu Tyr Pro Asn Lys Ala 

445 450 



3335 



AAA GGA GTG 
Lys Gly Val 
455 



;A AAA TTC AAT GTC CCT TTG GCT CAC ATG ATC 
.a Lys Phe Asn Val Pro Leu Ala His Met He 
460 465 



3377 



ACT GCT GG7 GCT GAT TTT ATG TTG GTT CCA AGC AGA TTT GAA 
Thr Ala GLy Ala Asp Phe Met Leu Val Pro Ser Arg Phe Glu 
470 / 475 480 



3419 



CCT TGT JGGT CTC ATT CAG TTA CAT GCT ATG CGA TAT GGA ACA 
Pro Cvs/Gly Leu He Gin Leu His Ala Met Arg Tyr Gly Thr 
65 430 495 



3461 



G T AAG AAC C A GAAGAGCTTG TACCTTTTTA CTGAGTTTTT AAJLAAAAGAA 
7CAT3AG--.CC TTGTTTTCCA TCTAAAGTTT AATAACCAAC TAAATGTTAC 
?GC?/gC?--.3Z T7TTCATT7C TGAAAATTGG TTATCTGATT TTAACGTAAT 



3511 
3561 
3611 



CACATGTGAG TCAG GTA CCA ATC TGT QCK TCG ACT GGT GGA CTT 

Val Pro lie Cys Khk Ser Thr Gly Gly Leu 
500 / 505 



3655 



GTT GAC ACT GTG AAA GAA GGC TAT ACT GGA TTC CAT ATG GGA 
Val Asp Thr Val Lys Glu Gly Tyr Thr Gly Phe His Met Gly 
510 / 515 520 



3697 



GCC TTC AAT GTT GAA 
Ala Phe Asn Val Glu 

525 



GTATGTGATT TTACATCAAT TGTGTACTTG 



3742 



TACATGGTCC ATTCTCGTCT TGATATACCC CTTGTTGCAT AAACATTAAC 
TTATTGCTTC TTGAATTTGG TTAG TGC GAT GTT GTT GAC CCA GCT 

Cys Asp Val Val Asp Pro Ala 

530 



3792 
3837 



GAT GTG CTT AAG ATA GTA ACA ACA GTT GCT AGA GCT CTT GCA 
As© Val Leu Lys He Vfel Thr Thr Val Ala Arg Ala Leu Ala 
535 / 540 545 



3879 



GTC TAT GGC ACC CTC/GCA TTT GCT GAG ATG ATA AAA AAT TGC 
Val Tyr Gly Thr Leiy Ala Phe Ala Glu Met He Lys Asn Cys 
550 / 555 560 



3S21 



ATG TCA GAG GAG CTC TCC TGG AAG GTAAGTGTGA AT T TGATAAT 
Met Ser Glu Glu l/eu Ser Tr© Lys 

65 



3965 



TTGCGTAGGT ACTTCAGTTT GTTGTTCTCG TCAGCACTGA TGGATTCCAA 
CTGGTGTTCT TGCA'G GAA CCT GCC AAG AAA TGG GAG ACA TTG 

Glu Pro Ala Lys Lys Trp Glu Thr Leu 
570 575 



4015 
4057 



CTA TTG GGC TTA GGA GCT TCT GGC AGT GAA CCC GGT GTT GAA 
Leu Leu Gly /Leu Gly Ala Ser Gly Ser Glu Pro Gly Val Glu 
530/ 585 590 



4099 



GGG GAA GAA ATC GCT CCA CTT GCC AAG GAA AAT GTA GCC ACT 
Gly Glu G^u He Ala Pro Leu Ala Lys Glu Asn Val Ala Thr 
595 600 605 



4141 



CCT TAA / ATGAGCTTTG GTTATCCTTG TTTCAACAAT AAG ATC ATT A 
Pro *** 
606 



4187 



AGCAASCGTA TTTACTAGCG AACTATGTAG AACCCTATTA TGGGGTCTCA 
ATCATCTACA AAATGATTGG TTTTTGCTGG GGAGCAGCAG CATATAAGGC 
TGTAAAATCC TGGTTAATGT TTTTGTAGGT AAGGGCTATT TAAGGTGGTG 
rCAAAC- TCAATAGAAA ATAGTTATTA CTAACGTTTG CAAC TAAA7A 
CTTAGTAATG 7AGCATAAAT AATACTAGAA CTAGTAGCTA A T A T A T A TGC 



— T H Zi ' 



4237 
4287 
4337 
4387 
4437 
4 4 67 
4537 

' - 5 "7 



/ 



33 

AGAAGTAATC AAATTCAAAT TAGTTGTTTG GTCATATGAA AGAAGCTGCC 4 637 

AGGCTAACTT TGAGGAGATG GCTATTGAAT/TTCAAAATGA TTATGTGAAA 4 687 

ACAATGCAAC ATCTATGTCA ATCAACAC7T AAATTATTGC ATTTAGAAAG 4737 

ATATTTTTGA GCCCATGACA CATTCATTCA TAAAGTAAGG TAGTATGTAT 4 7 87 

GATTGAATGG ACTACAGCTC AATCAXAGCA TCTCCTTTAC ATAACGGCAC 4 837 
TGTCTCTTGT CTACTACTCT ATTGGTAGTA GTAGTAGTAA TTTTACAATC ' 4 887 

CAAATTGAAT AGTAATAAGA TGGTCTCTAT TTACTAAAGT AGTAGTATTA 4 937 

TTCTTTCGTT ACTCTAAAGC AftCAAAA 4 964 



